Comment: Struggles with Survey Weighting and Regression Modeling
نویسندگان
چکیده
We congratulate the author on an informative and thought-provoking discussion on a topic of broad interest to the statistics community: the fitting of models to data collected through complex surveys. The number of papers written on this topic, whether from a model-based or design-based perspective, is substantial and goes back at least to Konijn (1962). This topic has led to some disagreements between those advocating that the design best be ignored when the primary interest is on the characteristics of the model, and those stating that the design cannot be ignored. More recently, both sides of this discussion have moved to something approaching a consensus, with those favoring a model-based approach acknowledging the need to account for nonignorable designs in the model fitting, while the traditional design-based view has been extended to explore certain circumstances under which it is appropriate to ignore the design. The current article is an excellent example of those recent discussions of why the design needs to be accounted for in modeling, and how this can be done in practice. The importance of fully accounting for the design by incorporating all relevant interactions provides a good motivation for the discussion of the range of methods in the article. It also stresses other aspects of importance to people working with survey data, in particular the desirability of maintaining scale/location invariance and linearity of the modelbased estimators. This ensures consistency of estimates for different variables in the survey, as well as
منابع مشابه
Comment: Struggles with Survey Weighting and Regression Modeling
Andrew Gelman’s article “Struggles with survey weighting and regression modeling” addresses the question of what approach analysts should use to produce estimates (and associated estimates of variability) based on sample survey data. Gelman starts by asserting that survey weighting is a “mess.” While we agree that incorporation of the survey design for regression remains challenging, with impor...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
I appreciate the opportunity to comment on Andrew Gelman’s interesting paper. As an admirer of Gelman’s work, it is a pleasure to read his take on the topic of survey weighting, which I have always found fascinating. Since I support Gelman’s general approach, I focus on reinforcing some points in the article and commenting on some of the modeling issues he raises. As a student of statistics, I ...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
In the ideal samples of survey sampling textbooks, weights are the inverses of the inclusion probabilities for the units. But nonresponse and undercoverage occur, and survey statisticians try to compensate for the resulting bias by adjusting the sampling weights. There has been much debate about when and whether weights should be used in analyses, and how they should be constructed. Professor G...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
This is an intriguing paper that raises important questions, and I feel privileged for being invited to discuss it. The paper deals with a very basic problem of sample surveys: how to weight the survey data in order to estimate finite population quantities of interest like means, differences of means or regression coefficients. The paper focuses for the most part on the common estimator of a po...
متن کاملRejoinder: Struggles with Survey Weighting and Regression Modeling
Encountering this sort of statement in the documentation of opinion poll data we were analyzing in political science: “A weight is assigned to each sample record, and MUST be used for all tabulations.” (This particular version was in the codebook for the 1988 CBS News/New York Times Poll; as you can see, this is a problem that has been bugging me for a long time.) Computing weighted averages is...
متن کاملStruggles with Survey Weighting and Regression Modeling
The general principles of Bayesian data analysis imply that models for survey responses should be constructed conditional on all variables that affect the probability of inclusion and nonresponse, which are also the variables used in survey weighting and clustering. However, such models can quickly become very complicated, with potentially thousands of post-stratification cells. It is then a ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008